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Abstract. A word w is called synchronizing (recurrent, reset, directed) 
word of a deterministic finite automaton (DFA) if w sends all states 
of the automaton on a unique state. Jan Cerny had found in 1964 a 
sequence of n-state complete DFA with shortest synchronizing word of 
length {n — 1)^. He had conjectured that it is an upper bound for the 
length of the shortest synchronizing word for any n-state complete DFA. 
The examples of DFA with shortest synchronizing word of length (n— 1)^ 
are relatively rare. To the Cerny sequence were added in all examples of 
Cerny, Piricka and Rosenauerova (1971), of Kari (2001) and of Roman 
(2004). 

By help of a program based on some effective algorithms, a wide class of 
automata of size less than 11 was checked. The order of the algorithm 
finding synchronizing word is quadratic for overwhelming majority of 
known to date automata. Some new examples of n-state DFA with mini- 
mal synchronizing word of length (n — 1)^ were discovered. The program 
recognized some remarkable trends concerning the length of the minimal 
synchronizing word. 
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Introduction 

We consider a DFA with complete state transition graph F and transition semi- 
group S over alphabet S. Let n be the size of DFA and q be the size of E. 
The problem of synchronization of DFA is natural and various aspects of this 
problem were touched upon the literature. Synchronization makes the behavior 
of an automaton resistant against input errors since, after detection of an error, 
a synchronizing word can reset the automaton back to its original state, as if 
no error had occurred. Therefore different problems of synchronization draw the 
attention. 
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A problem with a long story is the estimation of the minimal length of synchro- 
nizing word. Most known as a Cerny conjecture, it was aroused independently 
by distinct authors. Jan Cerny had found in 1964 n-state complete DFA with 
shortest synchronizing word of length (n — 1)^ for g = 2. He had conjectured that 
it is an upper bound for the length of the shortest synchronizing word for any 
n-state complete DFA. The problem can be reduced to automata with strongly 
connected graph 2J. The best known upper bound is now equal to (n^ — n)/Q 
[S], [5], [S], [12 . The conjecture holds true for a lot of automata, but in general 
the problem remains open. This simply looking conjecture is now one of the 
most longstanding open problems in the theory of finite automata. Moreover, 
the examples of automata with shortest synchronizing word of length {n — 1)^ 
are infrequent. After the sequence found by Cerny and example of Cerny, Piricka 
and Rosenauerova 3^ of 1971 for q — 2, the next such example was found by 
Kari [5] only in 2001 for n = 6 and q = 2. Roman [T3] had found an analogical 
example for n — 5 and q = 3 in 2004. There are no examples of automata for 
the time being such that the length of the shortest synchronizing word is greater 
than (n — 1)^. 

The testing of synchronizing automata is an indispensable part of investigation 
in this area [T], [J], [10], [TT], [T3], [Hj. The best known to date algorithm of 
Eppstein [TD] improves an algorithm of Natarjan [TT] and finds a synchro- 
nizing word for n-state DFA in 0{n^ + ri^q) time. 

We present a new efficient algorithm for finding a synchronizing word. The ac- 
tual running time of the algorithm on a lot of examples proved to be essentially 
less than in case of 0{n'^q) time complexity. For clear majority of automata, the 
time complexity is 0{n^q). It gives a chance to extend noticeably the class of 
considered DFA. This algorithm plays a central role in the program for search 
of automata with minimal reset word. 

The program studied all automata with strongly connected transition graph of 
size n < 10 for q = 2 and of size n < 7 for g < 4. All known and some new 
examples of DFA with shortest synchronizing word of length [n — 1)^ from this 
class of automata were checked. So all examples of DFA with shortest synchro- 
nizing word of length (n — 1)^ in this area are known for today. The size of the 
alphabet of the examples is two or three. The situation in the neighborhood of 
the bound {n — 1)^ of Cerny (minimal reset words of relatively great length) was 
also studied. 

There are no contradictory examples for the Cerny conjecture in this class of au- 
tomata. Moreover, the program does not find new examples of DFA with reset 
word of length (n — 1)^ for automata with n > 4 as well as for q > 3. No such 
examples exist for alphabet of size four if n < 7. 

And what is more, the examples with minimal length of reset word disappear 
even for values near the Cerny bound (n — 1)^ with growth of the size of the 
automaton as well as of the size of the alphabet. The gap between {n — 1)^ and 
the nearest of the minimal lengths of reset word appears for n = 6. There are 
no 6-state automata with minimal length of synchronizing word of 24 for q < 4. 
The following table displays this interesting trend for the length of minimal reset 



words less than [n — 1)^. 
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The program uses also straightforward algorithm for finding synchronizing word 
of minimal length. A help algorithm of the program verifies whether or not a 
given DFA is synchronizing. It is a modification of an algorithm of 0{n^q) time 
complexity supposed by Eppstein [10] . Our version has 0{TT?q) time com- 
plexity only in the worst case and we use usually only its linear part. 
The comparison of the experimental data suggests that the length of the syn- 
chronizing word found by central algorithm of the program is not far from the 
length of the minimal synchronizing word. This length was not greater than n? 
in all billions cases studied for today. The results of the algorithms altogether 
correspond to the Cerny conjecture. All above algorithms are implemented in 
our package TESTAS [TO]. 



Preliminaries 

Let us consider a deterministic finite automaton with state transition graph F 
and transition semigroup S over alphabet S. The states of the automaton are 
considered below as vertices of the transition graph F. 
The number of vertices of the graph F is denoted by \F\. 

A maximal strongly connected component of a directed graph will be denoted 
for brevity as SCC. 

If there exists a path v G 17+ from vertex p to vertex q in the transition graph 
of DFA then let us denote the vertex q as pw. 

Let Fv denote the mapping of the graph [automaton] F by help of w G , let 
us call \Fv\ rank and \F\ — \Fv\ defect of the mapping v. 

A word V G is called synchronizing word of an automaton A with transition 

graph F if \Fv\ = 1. An automaton (and its transition graph) possessing a 

synchronizing word is called synchronizing. 

A word w is called 2-reset word of the pair p, q if ptc = qw. 

Suppose p ^ q if pui = q for some word w. 

A state [a vertex] q is called sink of an automaton [of a graph] if p ^ q for all p. 
An automaton [a graph F] is called complete if for every state [vertex] p and 
every tr G ^ the state [vertex] per exists. 

The direct product F^ of two copies of graph 7^ over an alphabet S consists of 
vertices (p, q) and edges (p, q) (per, qu) labelled by a. Here p, q G -T, cr G 

1 Some auxiliary properties 

Two following two simple lemmas belong rather to the folklore. 



Lemma 1 IIBH, The directed labelled graph F is synchronizing if and 
only if has sink state. 

Lemma 2 The sets of synchronizing words of the graphs F and coincide. 

Lemma 3 Suppose p ^ Fs for a word s and a state p of transition graph F of 
DFA. 

Then there exist two minimal integer k and r such that ps'" = ps'°+'" . The pair of 
states p, ps'' has 2-reset word s*^ and for every i < k the pair of states ps', ps''"'"* 
has 2-reset word s^^^ . The word s^ is a 2-reset word for at least k different pairs 
of states. 

In the case r = 1 every pair of states ps', ps*^ for every i < k has 2-reset word 
s^-\ 

Proof. The sequence ps, ps^, ps*, ... is finite and belongs to Fs. Therefore 
such k and r exist. Two states ps' and ps''+' are mapped by the power s''^* on 
ps*^ = ps'^"'"'" as well as the states p and ps*" are mapped by the power s^ on 
ps*"'. All states ps* are distinct for i < k, whence the word s*^ unites at least k 
distinct pairs of states. 

In the case r = 1, two states ps* and ps'^ are mapped by the word s'^"* on 
ps*"' = ps'^+-'^ as well as the pair of states p, ps*"' is mapped by the power s'' on 
ps*^'. All states ps* are distinct for i < k, whence the word s'' unites also in this 
case at least k distinct pairs of states. 

Lemma 4 Suppose ra = ta for a letter a and two distinct states r, t of transi- 
tion graph F of DFA and let the states r and ra be consecutive states of a cycle 
C ofF. 

Then there exists a word s of length of the cycle C such that rs — r and 
\Fs\ < \F\. For some state p £ F \ Fs there exists a minimal integer k such 
that ps^ = ps*^"*"^. The pair of states p, ps*^ has 2-reset word s'^ and for every 
i < k the pair of states ps*,ps*'' has 2-reset word s^~'^ . The word s^ unites at 
least fc + 1 distinct states. 

Proof. A word s with first letter a can be obtained from consecutive letters on 
the edges of the cycle C. Therefore |s| is equal to the length of the cycle and 
rs — Y. \Fs\ < \F\ follows from ra — ta. 

From rs = r ^ t and ra — ta follows that ts = r 7^ t and ts* ^ t for any integer 
i. In the case t e F\Fs suppose p = t, and so the state p is defined. In opposite 
case for some state ti holds tis — t. If ti e F\Fs suppose p = ti, else for some 
state t2 holds t2S^ = t. Let us continue this procedure until tfc_i G F \ Fs for 
some k such that tfc-is'^^^ = t. Such minimal fc exists and all states t, ti, 
tj for j < k are distinct because ts* 7^ t for any integer i. The state t therefore 
has a preimage p = tk-i in F\Fs by mapping s'^"^, whence ps'^ = ps*^"*"^ = r. 
So the pair of states p, ps'^ has 2-reset word s'^ and for every i < k the pair of 
states ps^ps*^ has 2-reset word s*^"*. The states ps* for j < fc and p are dis- 
tinct because of the choice of k. The word s'^ maps all these states on the state r. 



Obvious is the following 



Lemma 5 Suppose qs = q for m states q from F and for some word s such 
that s*" = s''+^. Then \rs''\ = m. 

2 Synchronizing Algorithms 

The following help construction was supposed by Eppstein [3]. Let us keep for 
any pair of states r, q the first letter a of the minimal 2-reset word w of the 
pair of states together with the length of the word w. The corresponding letter 
of the pair of states ra, qa is the second letter of w. The 2-reset word w of 
minimal length can be restored on this way. The time and space complexity of 
this preprocessing is Od-T^j) |1| and it will be used in majority of considered 
algorithms. 

A help algorithm with 0{\r\'^q) time complexity in the worst case based on 
Lemmas [T] and [2] verifies whether or not a given DFA is synchronizing [4] , |19j . 
The main part of the algorithm follows [4] (see also [10]). Our modification of the 
algorithm finds first all SCC of the graph (a linear algorithm) and then checks 
the minimal SCC of the graph (if exists). The program for search of automata 
with relatively great minimal reset word uses this algorithm on the preliminary 
(and quite often linear) stage. 

An efficient semigroup algorithm, essential improvement of the algorithm from 
[4j , based on the properties of syntactic semigroup and inspired by Lemmas [3] - 
[5] is used on the next stage and plays a central role in the program. 

2.1 A semigroup algorithm for synchronizing word 

We consider the square and the reverse graph / of F. The graph / is not 
deterministic for synchronizing graph F. 

Suppose that the graph F is synchronizing, all sink states are found on the stage 
of checking of the synchronizability, the graph F^ and the reverse graph I were 
build. 

Let us find by help of the reverse graph / for any pair of states r, q from F"^ the 
first letter of the minimal 2-reset word w of the pair and the length of u; [3^ ■ 
for any pair of states (r, q) can be restored a 2-reset word w of minimal length. 
The set of states (r, q) can be ordered according to the length of the word w. The 
ordering can be made linear in the size of the set. One can find first the number 
of all pairs (r, q) with given length of minimal 2-reset word for any length, then 
adjust an interval for to place the pairs and then allocate the pairs of states in 
the interval according to the value of the length. 

We use also an another idea for to reorder the pairs of states. The number of 
preimages of the state rw = qw by mapping w'^ for any integer k can be used 
for the ordering together with the length \w\. Let us call this order the second. 
The number of preimages can be found in linear time for given pair of states 



(r, q) using the reverse graph I. The corresponding words may form a set of 
generators of a subsemigroup of the semigroup A of aU reset words and we will 
use only linear number of pairs studied for this aim. 

The important part of the preprocessing supposed by Eppstein was the comput- 
ing of the mapping Fw of the graph F induced by the minimal 2-reset word w of 
the pair of states r, q. This stage begins from the shortest words w and therefore 
is linear for any considered pair of states r, q. Nevertheless, the time complexity 
of the stage is 0{F'^). For to avoid the extremes of this step, our algorithm stops 
on linear number of pairs. The obtained set G of 2-reset words is considered as 
a set of generators of some subsemigroup from A and will be marked together 
with corresponding pairs of states. The time complexity of this step is therefore 
0(i^^). Let us reorder G in the second order and use the mapping of the graph 
induced by powers of generators. 

Let Fi be consecutive images of the graph F = Fq such that for Wi € A holds 
FiWi+i = ri+i and \Fi\ > l-Tj+il- Let Ai be a semigroup generated by the set 
wi, ... Wi. Let us check pairs of states corresponding to the words from G. If the 
pair belongs to Fi then the corresponding minimal reset word Wi+i may be used 
for to find the image /i+i. 

In the case no minimal 2-reset word of a pair from Fi was marked, let us consider 
the products of marked words. If some product unites a pairs of states of Fi, 
then let us use the mapping, mark the product of words and the pair of states. 
Let us notice that on this step are considered not all marked pairs. The number 
of considered products must be linear in the size of F. The product of two map- 
pings can be found in linear time. Therefore the time complexity of this stage is 
0{\F\k) for the defect k of the mapping of Fi. 

If two considered stages still do not find a reset word, then the new genera- 
tor must be added to considered subsemigroup Ai. Let us take a pair of states 
r, q from Fi with reset word w,. Suppose Wi = UiVi such that the word u, was 
marked. Then the mapping Wi can be found in irHujl time. Let us notice that 
only on this step the time complexity may by greater than quadratic. 

Lemma 6 Let Fi be consecutive images of the graph F = Fq such that for vt 
from semigroup A FiVi+i = -Tj+i, l-Tjl > l-fi+il and \Fs\ = 1 for some integer s. 
Let Ai he a semigroup generated by the set wi, ... Wi such that Wi = UiVi is a 
reset word for some pair of states from Fi-i and Vi is a marked element of the 
subsemigroup 

Then the considered algorithm has max{0{\F\'^q),0{\F\\u-i...Us\) time complex- 
ity. 

Proof. The time complexity of the step of the building of F"^ is 0{\F\'^q). So 

0(\F\'^q) is a lower boimd for the complexity of the considered algorithm. 

Let the set wi, ... Wi generate Ai . The creation of the mapping Wi needs | r' 1 1 | + 1 

steps because for the marked element Vi the mapping is known. 

The element will be marked and used only if it is either a generator from Ai or 

a product of two marked elements. With a marked semigroup element will be 

associated the mapping of F defined by the element. The finding of the mapping 



of the product of two elements with known images is linear in the size of the 
graph. 

We repeat the process with the obtained image Fi. The defect of the mapping is 
growing on every step. After not over than \r\ — 1 steps F will be synchronized. 
The process of recording of the synchronizing word is linear in the length of the 
word. The length of the synchronizing word found by the algorithm in billions 
of practical experiments was less than \F\'^ in all considered cases. The stage 
of adding of new generators was used only in a small number of cases, only 
some percents of considered synchronizing automata. The minimal number of 
generators of the semigroup A is usually small. For instance, for all Cerny graphs 
there are only two generators. Therefore the time complexity of the algorithm is 
0(\F\'^q) in overwhelming majority of cases and the algorithm can be considered 
as almost quadratic. 

2.2 Modification of Eppstein algoritlim 

Some version of the program uses also a modification of Eppstein algorithm [4] , 
[10] for finding synchronizing word of 0(|/^p + time complexity. The fa- 

vorable idea of Eppstein was to keep with any pair of states r, q the first letter 
of the minimal reset word w, its length and the image of the set of states by help 
of the mapping induced by the word w. The building of the images has O(lr'l^) 
time complexity and is a most wasteful part of the algorithm. 
Our modification of the Eppstein algorithm (called below a cycle algorithm) in- 
stead of a word w considers a power of this word until stabilization of the rank 
of the image. It proved to be fruitful in many cases including such extraordinary 
case as graphs of Cerny [2]. The length of the reset word obtained by the al- 
gorithm in this case reaches its minimum. We omit sometimes this stage of the 
program despite the growing number of the graphs studied on the next stage. 
Nevertheless, the observation period of the whole of the program is essentially 
smaller in spite of the fact that the next stage is non-polynomial. 

Theorem 7 J^, \BI Let C he set of size k and let us consider a sequence of 
its subsets Ci of size m such that any d includes a two-element subset of C 
not included in every Cj for j < i. Then the length of the sequence is less than 
(fc-m-|-2)=(=(fc-m + l)/2. 

Corollary 8 Let F be transition graph of an automaton with \F\ states and let 
us consider a sequence of subsets Ci of states of the automaton of size m or less 
such that any Ci includes a two- element subset of states of F not included in 
every Cj for j < i. Suppose the length of the sequence is {\F\ — m + 2) * {\F\ — 
m -\- l)/2. Then at least one Ci contains less than m states. Any sequence of 
length {\F\^ — |-r|)/6 of considered kind for distinct m contains a set of size one. 

The value — |F|)/6 is well known and was mentioned time and again [5], [8], 
[9] , [12] . The combinatorial theorem [7] can be used for estimation of the length 



of the reset word obtained by Eppstein, cycle and semigroup algorithms. The 
theorem considers distinct mappings of the graph of the automaton induced by 
the letters of the alphabet of the labels such that any new mapping has at least 
one pair of states that does not belong to any previous mapping of the same 
rank. For given rank k of mapping in considered algorithms there are at most 
{\r\ + k){\r\ + /c — l)/2 or less than \r\ distinct mappings. The pair of states 
with a most short reset word creates a sequence of such mappings and therefore 
the theorem [7] can be used here. Corollary [H] implies 

Proposition 9 The length of the reset word obtained by Eppstein, cycle and 
semigroup algorithms is less than (l-Tp — |-r|)/6. 

So the time complexity of the algorithm in the most worst case is 0{\^\^^q). 
Really this most worst case is very rare, for all automata studied for today by 
these algorithms, it was less than 

2.3 An algorithm for finding synchronizing word of minimal length 

On the last stage, the program uses a straightforward algorithm for finding syn- 
chronizing word of minimal length. The last one is not polynomial in the most 
worst case (the finding of the synchronizing word of minimal length is NP-hard 
[1] ) [in] ) [H] ) • The program for search of minimal reset word uses this algorithm 
relatively rare. 

The algorithm is a revision of an algorithm for finding the syntactic semigroup 
S of size s with q generators on the base of transition graph [T7] . We find map- 
pings of the graph of the automaton induced by the letters of the alphabet of the 
labels. Mappings with the same set of states are identified. It essentially simpli- 
fied the process in comparison with the algorithm from [17j . Distinct mappings 
are saved. For this aim, any two mappings must to be compared, so we have 
0{s{s — l)/2) steps. Let us notice that the size of the syntactic semigroup is in 
general not polynomial in the size of the transition graph. 

The mappings correspond to semigroup elements. With any mapping let us con- 
nect a previous mapping and the letter that creates the mapping. On this way, 
the path on the graph of the automaton can be constructed. 

Proposition 10 The algorithm finds a list of all words (elements of syntactic 
semigroup ) of length k where k is growing. The first synchronizing word of the 
list has minimal length. 

The time complexity of the considered procedure is 0{\r\qs'^) with 0{\r\s) space 
complexity. 

2.4 Checking synchronizability 

The algorithm is based on the Lemma [T] and presents a modification of an algo- 
rithm from [4]. 

First let us check SCC using the first-depth search and find the SCC Fg of sink 



states from F. If there are no sink state then the graph has no synchronizing 
word and the algorithm stops. Exactly one sink state implies synchronizability 
and the algorithm also stops. The time and space complexity of these step are 
linear. Now we can consider the graph Fs with at least two sink states. 
The next step is the consideration of F^. We unite any pair of states (p, q) and 
(q, p), all states (r, r) are united in one state (0, 0). Then let us mark sink state 
(0, 0) and all ancestors of (0, 0) using the first-depth search on the reverse of 
the obtained graph G. The graph F is synchronizing if any node of G will be 
marked. The time and space complexity of the algorithm in the most worst case 



3 Experimental data 



The considered synchronization algorithms were used in a program for search of 
automata with minimal reset word of relatively great length. The program has 
investigated all complete DFA for n < 10, q = 2 and ioi n < 7, q < 4. 
An automaton with k states outside sink SCC A of the transition graph can be 
mapped on A by word of length not greater than k(k — l)/2. Therefore only au- 
tomata with strongly connected transition graphs need investigation. The graphs 
with synchronizing proper subgraph obtained by moving off letters from the al- 
phabet are omitted too. The program reduced also the number of studied iso- 
morphic copies of automata. The case of n = 2 is not considered because any 
synchronizing automaton with two states has reset word of length (n — 1)^ ~ 1. 



The known rt-state automata with minimal reset word of length {n 



are 



presented by sequence of Cerny 




by automata supposed by Cerny, Piricka and Rosenauerova [3] (CPR), by Kari 
[B] and Roman [T^ . 






Our program has found five new following examples on the border (n — 1)^. The 
loops of the complete graphs are omitted here for simplicity. 



a a a b a 

The corresponding reset words of minimal length are: abcacabca, acbaaacba, baab, 
acba, bacb. All considered algorithms have found the same reset word for every 
example. The size of the syntactic semigroup found by the package TESTAS is 
148, 180, 24, 27 and 27 correspondingly. 

No doubts that some automata from this list, especially for ri = 3, were some- 
times studied by specialists, but wc have not found any mention of. 
There are no contradictory examples for the Cerny conjecture in considered class 
of automata. Moreover, the program does not find new examples of automata 
with reset word of length (n — 1)^ for n > 4 and g > 3. 

And what is more, the examples with minimal length of reset word disappear 
even for values near the Cerny bound (n — 1)^ with growth of the size of the 
automaton. The gap appears for n = 6. There are no 6-state automata with 
minimal length of synchronizing word equal to 24 for q < 4. 
The following table displays this noteworthy trend for the maximum of lengths 
of minimal reset words. The mentioned above examples on the Cerny border are 
not taken in account in the third line of the table. 
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The gap between (n — 1)^ and the length of the minimal reset word grows with 
n. This growing gap supports the following funny 

Conjecture 1 The set ofn-state DFA {n > 2) with minimal reset word of length 
(n — 1)^ contains only the sequence of Cerny and the eight automata mentioned 
above, three of size 3, three of size 4, one of size 5 and one of size 6. 



Let us consider the synchronization algorithms from the package TESTAS on 
some above-mentioned objects and on a modification [TSj of a graph KMM sup- 
posed by Kim, McNaughton, McCloskey [7j. 

b 

O- 
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p t 

Complete closure KMML of this graph is obtained from KMM by adding loops 
in all necessary cases. The n-state automata supposed by Cerny will be denoted 



by C< n >. 

The following table presents the name of the automaton, the number of its states, 
the size of the syntactic semigroup, the length of synchronizing word found by 
the Eppstein algorithm [4J, by the cycle and the semigroup algorithm, by the 
minimal synchronizing word algorithm with the corresponding number of map- 
pings of the set of states. 
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One can compare the results of the algorithms. Equality of the length of minimal 
synchronizing word and of synchronizing word found by the semigroup algorithm 
and by Eppstein or cycle algorithm holds in some cases. In particular, it's true 
even for such extreme objects as Cerny automata. Moreover, we obtain not in- 
frequently the same synchronizing words. The transition semigroup of the Cerny 
automaton has a nilpotent element of order n — 1, and the minimal synchronizing 
word of the automaton is a subword of a power of this element. 
As for the size of the syntactic semigroup from the table, the most discouraging 
example gives us the Kari automaton. The size of the syntactic semigroup of 
the Cerny automaton is very great too, it is about 0(2^"). Maximal size n" of 
the syntactic semigroup is reached for the examples of n = 3, q = 3. It is the 
semigroup of all transformations of 3-element set. 
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